14 research outputs found

    Joint Synthesis of Safety Certificate and Safe Control Policy using Constrained Reinforcement Learning

    Full text link
    Safety is the major consideration in controlling complex dynamical systems using reinforcement learning (RL), where the safety certificate can provide provable safety guarantee. A valid safety certificate is an energy function indicating that safe states are with low energy, and there exists a corresponding safe control policy that allows the energy function to always dissipate. The safety certificate and the safe control policy are closely related to each other and both challenging to synthesize. Therefore, existing learning-based studies treat either of them as prior knowledge to learn the other, which limits their applicability with general unknown dynamics. This paper proposes a novel approach that simultaneously synthesizes the energy-function-based safety certificate and learns the safe control policy with CRL. We do not rely on prior knowledge about either an available model-based controller or a perfect safety certificate. In particular, we formulate a loss function to optimize the safety certificate parameters by minimizing the occurrence of energy increases. By adding this optimization procedure as an outer loop to the Lagrangian-based constrained reinforcement learning (CRL), we jointly update the policy and safety certificate parameters and prove that they will converge to their respective local optima, the optimal safe policy and a valid safety certificate. We evaluate our algorithms on multiple safety-critical benchmark environments. The results show that the proposed algorithm learns provably safe policies with no constraint violation. The validity or feasibility of synthesized safety certificate is also verified numerically.Comment: 24 pages, 8 figures, accepted for oral presentation at L4DC 202

    What Truly Matters in Trajectory Prediction for Autonomous Driving?

    Full text link
    In the autonomous driving system, trajectory prediction plays a vital role in ensuring safety and facilitating smooth navigation. However, we observe a substantial discrepancy between the accuracy of predictors on fixed datasets and their driving performance when used in downstream tasks. This discrepancy arises from two overlooked factors in the current evaluation protocols of trajectory prediction: 1) the dynamics gap between the dataset and real driving scenario; and 2) the computational efficiency of predictors. In real-world scenarios, prediction algorithms influence the behavior of autonomous vehicles, which, in turn, alter the behaviors of other agents on the road. This interaction results in predictor-specific dynamics that directly impact prediction results. As other agents' responses are predetermined on datasets, a significant dynamics gap arises between evaluations conducted on fixed datasets and actual driving scenarios. Furthermore, focusing solely on accuracy fails to address the demand for computational efficiency, which is critical for the real-time response required by the autonomous driving system. Therefore, in this paper, we demonstrate that an interactive, task-driven evaluation approach for trajectory prediction is crucial to reflect its efficacy for autonomous driving

    An Effective Data Sharing Scheme Based on Blockchain in Vehicular Social Networks

    No full text
    Vehicular social networks (VSNs) are the vehicular ad hoc networks (VANETs) that integrate social networks. Compared with traditional VANETs, VSNs are more suitable to serve a group of vehicles with common interests. In VSNs, vehicles can upload the necessary data in the cloud service provider (CSP) and other vehicles can query the data they are interested in through CSP, which enables VSNs to provide more user-friendly services. However, due to the wireless network communication environment, the data sent by the vehicle can easily be monitored. Adversaries are able to violate the privacy of the vehicle based on the collected data, thereby threatening the security of the entire network. In addition, if a vehicle shares malicious or false data with other vehicles, it is easy to mislead drivers and even cause serious traffic accidents. This paper proposes an effective data sharing scheme based on blockchain in VSNs. By integrating an identity based signature mechanism and pseudonym generation mechanism, we first propose an anonymous authentication mechanism as the basis for establishing trust relationships before data transmission between entities in VSNs. Then, a data sharing scheme based on blockchain is described, in which the signature mechanism and the consensus mechanism guarantee the security and traceability of data. The result of the performance analysis and the simulation experiment indicate that VAB can achieve a favourable performance compared with existing schemes

    Design of a loudspeaker array for personal audio in a car cabin

    No full text
    With an increasing interest in personal audio systems, the car cabin is an important environment in which to generate different audio programs in different regions, without the use of headphones. Two algorithms, acoustic contrast control and the least squares method, are considered for producing two independent listening zones, one zone including the front passengers and the other including the rear passengers. The generation of an acceptable acoustic contrast between the front and rear zones, using an array of four standard audio loudspeakers, is limited to relatively low frequencies. In order to provide acceptable control over a larger audio bandwidth, a loudspeaker array mounted on the ceiling of a car cabin is investigated in this paper. A selection process for the configuration of the source array is described. Free field simulations are used to calculate the response of the source array and investigate the performance of the two control algorithms. Since the performance of the least squares method is dependent on the chosen target sound pressures, a method for selecting the target sound pressures is also proposed. Finally, the proposed loudspeaker array is implemented in a real car and the measured results are found to be similar to those predicted from the simulations

    Real-Time Center of Gravity Estimation for Intelligent Connected Vehicle Based on HEKF-EKF

    No full text
    The vehicle center of gravity estimation is the key technology to the vehicle active safety system in intelligent connected vehicles. In this study, an integrated estimation approach for center of gravity (CG) combining Huber Extended Kalman Filter and Extended Kalman Filter (HEKF-EKF) is proposed. First, HEKF algorithm is used to estimate the distance between the CG and the front axle at the current time. Then, the CG height obtained by HEKF and EKF algorithms is weighted to obtain the optimal estimate value. Finally, the results show that the algorithm’s estimation convergence time is 2 s, its longitudinal position estimation error is less than 2%, and its center of gravity height estimation error is less than 3%. The longitudinal and vertical positions of the vehicle CG can be accurately estimated using this method. This method can help advance the development of active safety technology

    Zeroth-Order Actor-Critic

    Full text link
    The recent advanced evolution-based zeroth-order optimization methods and the policy gradient-based first-order methods are two promising alternatives to solve reinforcement learning (RL) problems with complementary advantages. The former methods work with arbitrary policies, drive state-dependent and temporally-extended exploration, possess robustness-seeking property, but suffer from high sample complexity, while the latter methods are more sample efficient but are restricted to differentiable policies and the learned policies are less robust. To address these issues, we propose a novel Zeroth-Order Actor-Critic algorithm (ZOAC), which unifies these two methods into an on-policy actor-critic architecture to preserve the advantages from both. ZOAC conducts rollouts collection with timestep-wise perturbation in parameter space, first-order policy evaluation (PEV) and zeroth-order policy improvement (PIM) alternately in each iteration. We extensively evaluate our proposed method on a wide range of challenging continuous control benchmarks using different types of policies, where ZOAC outperforms zeroth-order and first-order baseline algorithms

    Effect of cesium ion on the synthesis and catalytic properties with FeCo Prussian blue analogue

    No full text
    International audienceIn this article, cesium ions were introduced into the FeCo Prussian blue analogues (PBA) catalyst to increase the styrene conversion and the selectivity in styrene oxide in the reaction of epoxidation of styrene. The conversion of styrene over CsFeCoPBA-1 on the optimal condition reach up to 96%, in comparison to 88% over FeCoPBA. The catalysts were characterized by fourier transform infrared spectroscopy, X-ray diffraction, transmission electron microscopy (TEM), X-ray energy dispersive spectroscopy (EDS), thermogravimetric analysis (TG) and N2 adsorption/desorption, respectively. To further illustrate the phenonmen, the detailed kinetic studies on epoxidation of styrene were carried out. It is found that the rate equations is similar between the catalyst CsFeCoPBA-1 and FeCoPBA, while the reaction activation energy with CsFeCoPBA-1 catalyst is reduced obviously. Meanwhile, isothermal titration calorimetry (ITC) had also been employed to directly probe thermodynamic changes during the formation of the PBA with or without Cs ions. It is noteworthy that higher enthalpy change in the presence of cesium ions
    corecore